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Abstract: An exact expression for the determinant of the splitting matrix is derived: it 
allows us to analyze the asymptotic behaviour needed to amend the large angles theorem 
' proposed in Ann. Inst. H. Poincare, B-60, 1, 1994. The asymtotic validity of Mei- 

nokov 's formulae is proved for the class of models considered, which include polynomial 
perturbations. 
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D' §1. Introduction. 

c/^ : 

' Recently V. Gelfreich noted that a "theorem" in [CG] contains an error. The theorem 

I gave a lower bound on the splitting angles in a three degrees of freedom system and it was 

• needed to show the existence of heteroclinic chains in a class of hamiltonian systems with 

^ I the aim of an apphcation to a celestial mechanics problem. We correct it here by providing 

the correct lower bound and, at the same time, exposing again and in a more meditated 
[ form some ideas of [CG]. In the present paper we do not discuss the existence of Arnold's 

Q^ ' diffusion, [A2], in our systems. We do not discuss the celestial mechanics apphcation of 

' [CG] either, as two parts of it (see below) relied on the erroneous statement, and more 

work is needed. The present paper is, therefore, not a correction of the implications of 
the error in [CG] but only of the error itself. In order to derive the same implications 
Ch ' further work is necessary as the erroneous result was used several times in the last three 

sections of [CG]. Each use has therefore to be treated separately. 

Our analysis deals mainly with systems with three different time scales with the ratio 
O , between the smallest to the largest being ^ 1: e. g. the largest is f]^^^'^, the intermediate 

^ ' is 1 and the smallest is ry", a > O (the a = O case being a limiting two scales problem). 

^ , The result will be called the large angles theorem, (theorem 2 in §5, proved in §6, §7 

for the systems defined in (2.1), (7.1) below), and describes a property of a pendulum 
subject to both a slow periodic force and to a rapid periodic force (incommensurate to 
' the former) that we imagine generated by a pair of rotators, whose positions are given 

by two angles a, A while the pendulum position is given by an angle ip. The angles a, A 
will be respectively called slow and fast. 

When the system is perturbed some of the quasi periodic motions performed by rotators 
(or clocks) while the pendulum is in its unstable equilibrium persist. They become 
motions in which the pendulum is slightly moving quasi periodically around its unstable 
position ("without ever falling down"): one has therefore an invariant 2-dimensional 
torus in phase space. For each position (a. A) of the rotators there is well defined value of 
the pendulum variables (/, (p) as well as of the rotators actions {A, B). And (a. A) evolve 
quasi-periodically {i. e. (a. A) can be written as a = V'! + (V'! j "02)1 A = V'2 + A2 ('i/'i, ^02) 
and the time evolution is simply linear ip — > ip + uj_t for a suitable lo). 
Obviously the above quasi periodic motion is unstable and perturbations will make the 
pendulum "fail" . The 2-dimensional invariant tori have therefore stable and unstable 
manifolds: which coincide in absence of perturbations (because they correpond to the 
separatrix motions and the pendulum separatrix is degenerate). However under per- 
turbations they "split" (i. e. become distinct 3 dimensional surfaces). Often, simply by 
symmetry, there is one trajectory that lies on both manifolds. Physically it corresponds 
to a motion that consists of just one swing of the pendulum from nearby the unstable 
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position back to it. 

If we observe the swinging trajectory at the moment it passes through the stable equi- 
librium position, <f = tt, we see just a point that can be identified by the value a^, at 
that moment, of the a coordinatcs (whcro tho two manifolds meet). If wc movc away 
from that point the A coordinates on the two manifolds become difFerent, at the same 
a, and their difference is a "spUtting" vector Q (a). The 2x2 matrix -D = 9q Q («o) 
is callcd the intersection matrix. 

The homochnic angles can be defined as the angles whose tangents are the eigenvalues 
of the intersection matrix. The spUtting is usually defined as detD. When the perturb- 
ing frequcncics are hcld fixcd and the pcrturbation is scnt to O thcrc is a wcU known 
asymptotic expression for the sphtting, called Melnikov's formula and coinciding with 
the "first order perturbation theory result" . 

The sphtting problem is to find undcr which conditions Melnikov 's formula holds when 
both the perturbation and the shortest forcing period are sent to 0. 
Informally our result is (see theorem 2 in §5 for a formal statement): 

Suppose that the forcing freguency is very large, say i]^^/"^ times the pendulum f requency, 
with rj very small. Suppose also that the second characteristic freguency is relatively 
very sm,alL say rj"" times that of the pendulum with a > 0. Then there are perturhations 
of size e = 0{rf) (for some c > Oj such that the separatrix spUtting has size given 
asymptotically by Melnikov's formMla provided the freguencies of the motion verify a 
suitable diophantine condition. In fact this property holds generically, under the same 
conditions, for perturbations of size e = 0{t}'') which are trigonometric polynomials. 

There are many examples of systems for which the above property does not hold, see 
[S], [DJGS], [RW]. But the forcing freguencies relations are different. 

The corrcct lowcr bound cstimatc on the sphtting makes an analysis to all orders ncc- 
essary: the analysis, performed here, becomes "marginal" but the final result on the 
existence of homoclinic splitting {i.e. a lower bound on it) remains valid. Therefore the 
prcscnt papcr corrccts the error in §10 of [CG] as far as its implications on the size of 
the splitting are concerned. 

The techniques we use here to bypass perturbation theory were started in the Appendix 
A13 of [CG]: thcy were not pushed too far because the error made any developments 
unnecessary for the purposes of [CG]. 

The techniques were subsequently developed in [G3] (which does not repeat the com- 

putational error). The mcthods of [G3] were not developed to trcat threc timc scales 
problems: the aim there being to prove the smallness of the splitting in two time scales 
problems (i.e. in systems not considered here with all rotators with comparably large 
velocity, a = — |)- But thcy can bc easily cxtendcd to threc timc scales problems and 
even lead to a remarkable nonperturbative and exact computation of the leading order 
(exponentially small) of the intersection matrix, see (6. 12), (7. 19): V. Gelfreich stressed 
neccssity of a nonperturbative analysis by a simplc cogcnt argumcnt. 
The reason why we study three time scales systems is simply that they arise in a celestial 
mechanics problem of interest to us, see §12 of [CG]. But most of our results hold also if 
the slow time scale is the same as that of the pendulum, i.e. the parameter a abovc (see 
also (2.1)) is a = O, see also comments in §9: this is also an interesting case and it has 
been considered, to some extent, in [RW]. 

The recent work of [RW] considerably overlaps with ours: even when a = O (a two time 
scales problem) it cannot be used to achieve all our results because our assumptions can 
violate eq. (15) of [RW] {i.e. the bound on the constant called b in [RW], see (2.2), below, 
for the isochronous case and (7.3) for the anisochrounus) . We also require, as an essential 
assumption, that the perturbations be trigonometric polynomials while the examples of 
[RW] require, as an essential assumption, that the perturbation has infinitely many non 
vanishing modes (see eq. (19) of [RW]). Another essential difference with respect to [G3] 
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and [RW] is that wc only study thc splitting at the hom,oclinic point whilc thcy study 
it everywhere: hence our work is much more limited in scope. On the other hand we 
recover some of the results of [RW] because our proofs also apply nontrivially to two time 
scale cascs with 3 dcgrccs of freedom (thc oncs in [RW]). Ncvcrtheless we feel that thc 
main difference between our work and [RW] hes in the techniques: here we show that 
the techniques of [G3] do apply immediately to the problem. See concluding remarks for 
a more technical comparison. 

§2. Isochronous clock— pendulum system. 

Our main result concerns anisochronous systems, (7.1). Isochronous models will be 
considcrcd only to illustrate the simplcst cascs: thc canccUations that wc find in thc 
anisochronous case would look miraculous otherwise. Calling {I, (f ), {A, a), (-B, A) pairs 
of canonical coordinates suppose them action-angle variables: {(f, a, A) is a triple of 
anglcs (varying on the 3-dimensional torus = [O, 27r]^) and (/, A, B) G R^. 

The hamiltonian will be: 

i/ = r?M+y +r?-i/2B+ 52(cosvp-l) + Mr?'/(<^,a,A), e=^ m" (2.1) 

where 77, /x > O and / is an even trigonometric polynomials in the angles {(p, a, A), i. e. for 
instance one could take f{ip, a, A) = ( cos(a + cp) + cos(A + (p)) . The parameter jj sets 
thc ratios between time scales and is a frec parameter which we take, eventually, to be 
close to zero in order to study asymptotic properties as rj ^ 0. 

If /i = O the 2-dimensional torus: A = B = I = 0, ip = and a = (a, A) arbitrary, 
is invariant and run quasi-periodically with rotation velocity w = (ry", ry~^/^): it will be 
called the unperturbed torus. 

In the above case, i.e. in the isochronous case only, we shall suppose (not "for simplic- 
ity": but as an essential hypothesis) that the vector lo = (r?", r?~^^^) is a diophantine 
vector: 

k • i^l > C-^7?''|l/|-^ (2.2) 

for somc diophantine constant C, r and somc d > 0. This restricts thc vahics of t/ that 
we can consider. For a € [0,5] it still allows sequences rjk, such that (2.2) holds with 
prefixed t > 2,d = a and C large enough, with rj = rjk = r]ik~^, for some 771 G [5, 1] and 
all integers k large enough. 

Remarks: (1) A related H with a ~ \ ariscs in a celestial mechanics problem ncar a double 
resonance, responsiblc for the time scales diffcrcnccs (scc (12.39) in [CG], aftcr scaling 
away the factors lut to put H in dimensionless form and with several factors replaced here 
by constants, for simplicity). A first simplification of (2.1) compared to the "realistic" 
model in §12 of [CG] is thc abscncc of an additivc isochrony brcaking term ?7^: taking 
it into account does not essentially change the analysis of the splitting results (even in 
its quantitative aspects on the asymptotics as 77 — > O, see §7). A second simplification 
is the abscncc in (2.1) of a further pcrturbation /3 /o which is not small {i.e. /o,/? are 
77, /x-independent) but depends only on the "fast" angle A and on (p: f o = fQ{(p,X). 
Taking it into account is a problem not discussed here. However it does not change the 
qualitativc aspects but only thc quantitative ones as long as the system is isochronous, 
see the discussion at the end of §8. 

(2) in the above hamiltonian is fixed [i.e. it is r}, jj, independent): eventually we take 
g = I for simplicity. Thc paramctcrs /i, 77 are free and wc shall bc intcrcstcd in them 
having a "small value" : note that if 77 ^ O the rotation vector uj of the unperturbed 
torus (I = (p = O, B = A = 0) has a size tending to 00. The even parity assumptions on 
the "interactions" / simplifies, possibly in an essential way, the analysis. 
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In thc prccossion problem of [CG] 77 is thc dcviation from sphorical shapc of a planet 
precessing around its baricenter, which moves on an ellipse of (fixed) eccentricity e = iirj". 

(3) A general physical interpretation of (2.1) is that of a system consisting in (i) a forcing 
clock with angular vclocity 77° {i. e. a point moving on a circlc with angular vclocity ry", 
position a and action variable A); (ii) a pendulum (i. e. a point moving on a vertically 
placed circle with angular momentum / and position counted by taking (f = O as the 
unstable equilibrium position of tlic pendulum) ; (iii) a second forcing clock with angular 
velocity ri~^^^ and action variable B. Equivalently one can delete the Arj"', Br]~^^^ terms 
and replace a, Xhy ut = {r]°'t, r]~^/'^t) thus regarding the system as a time dependent one, 
consisting of a pendulum subject to a quasi periodic force with periods 2'Kri~"' . 
The "characteristic time" of the pendulum system \s To = . 

(4) Finally the "coupling constant" in (2.1) is written as /zry'^ and not just /x, because the 
convcrgcnce radius of thc cxpansions around thc unpcrturbcd torus is cxpcctcd to bo of 
the order of some power oirj: it would be nice to know the best value of c (thus replacing 
the constant c by its optimal value) but it seems not known, see [HMS], [ACKR] (it 
seems that c > i might be the right condition, but below we make no attempt at getting 
even close to such a small value). 

§3. Separatrices and non degeneracy. 

Supposing /z = O in (2.1) we look at thc unstable quasi periodic motion with A = Q, B = 
O, / = 0,<p = O where a = ao + r/'^f and A = Ao + rj^^/'^t. This is a family of motions 

def 

whose initial data are parameterized by a = (ao,Ao) and therefore form an invariant 

torus: we shall call it a hyperbolic torus. The energy of such motions is H = 0. If we 
fix A, B at other valucs wc find a continuum of invariant hypcbolic tori, with rotation 

vector LO (?y", ry"^/^). The energy of such motions is r]~^^^B + jfA. 
The torus is unstable and its unstable manifold W~ is 3-dimensional and contains the 
set parameterized by ((^, a, A) via the equations: 

^ = 0, / = /(<p), (a,A)e[0,27r]2, (pe(0,27r) (3.1) 

where /((/?) = /"^(v?) g\/2(\. — cosip) is the pendulum separatrix. But also the set 
with I = is part of the torus unstable manifold because of the meaning of 

as an angle. The stable manifold has equations I{lp) = ±l'^{(p): this means that 
W'^ = W~: a well known degeneracy of thc pendulum motion. 

A nice way of seeing the separatrix motions is by representing them in the (canonical) 
Jacobi 's coordinates: these are coordinates (p, q) in terms of which the neighborhood of 
the (two) lines / = ±/°((^) in (3.1) is represented, near I = ip = O, as: 

I = Ro{p,q), ip = So{p,q) (3.2) 

where thc functions i?0: Sq arc dcfincd ncar thc origin, where they vanish, and the pendu- 
lum motion becomes, in such coordinates, simply p{t) = Poe~^*, q{t) = goc^*- Therefore 
the {p,q) coordinates describe globally, as t — > +00, the evolution of initial data (^0,0) 
and globally, as t ^ —00, those with data {0,qo). In such coordinates thc cquation of 
the stable manifold is simply p = O while that of the unstable manifold is g = 0. 
The functions ^o, i?o have well known holomorphy properties: the latter imply that the 
singularitics of i?o(p'5^*: 0), -Ro(0, ge~^*), at fixcd p or q, occur a.t t = ±i^g~^, and thc 
same holds for So- This can be seen from the explicit solution of the pendulum equation 
in terms of elliptic integrals: an elementary analysis is in Appendix A9 of [CG], or in 
Appcndix Al of [G5]. 

The first result that we use to set up a general picture but strictly speaking not even 
necessary as shown in [Gel], is the (well known, see [Gr]) stability of such unperturbed 
hyperbolic torus and of its stable and unstable manifolds. This will be stated as: 
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Theorem 1: Suppose c in (2.1) large enough, and suppose that rj is such that the 

rotafAon vector {rj"' ,Tj~^/'^) verifies the diophantine property (2.2) and iiq > O is 

small enough. One can define functions T, S, A, divisible by ji and analytic in the 
variables {tjji,il)2,P,Q, /j), varying respectively on the torus [0, 27r]^, on a neighborhood of 
p = q = O, and in < /io with /io small enough, such that setting, as tjj varies on 

T2 = [0,2^]2; 

A^Aa + E{^p_,p,q), B = Ba + T{i^,p, q), a = -01, A = V2 
I =Ro{p,q) +A{jjj_,p,q), if = Soip,q) + Q{±,p,q) 

one defi/nes. for all A^, Bq, an invariant set on which the motion described by (2.1) takes 
place following t ^ {ip + uj_t,pe~'^* ,qe^'^*) (as long as the (j>,q) stay in the domain of 
definition of 'E.,T, A,Q), with g = (1 + ^{n))g and 7 analytic in /i, near fx = O and 
divisible by fi. The functions F, S evaluated at p = q = O have zero average with respect 
to tp ! o-lso the time average of H vanishes on the above motions and (therefore) H = 
for all of them. 

The radius of convergence fio is uniform inri <1, as long as o; verifies the diophantine 

condition. 

Remarks: (1) The thcorcm implics that if p = = O then the parametric equations 
ip = e(V^,0,0), a = ^1, A = V2, / = A(V^,0,0), A = S(;^,0,0) and B = r(V^,0,0) 
describe, as V varies on the 2-diniensional torus [0, 27r]^, an invariant torus. The quasi 
pcriodic rotation ijj ^ ip + tjt with lo = {rj"' ^rj'"^/'^) givcs, for all ^/j , a sohition to 
the equations of motion. This means that the invariant torus of dimension 2 that we 
considered in §2 (ie. ^ = B = 7 = O, <p = 0) survives the onset of perturbation and 
pcrsists, slightly deform,ed, with the samc rotation vector lu and a slightly varied pair of 
Lyapunov exponents {i. e. ±g = ±(1 + 7) 5 rather than ±5). The zero average property 
for r, S means that the perturbed torus is in the average (over tp ) located at the same 
position as the unpcrturbcd onc from which it "cmanatcs" : this is useful as it allows us 
to parameterize the invariant tori by their average position in action space. 

(2) Setting p = O, g 7^ O one finds a surface of dimension 3 which is a part of the unstable 
manifold of the torus, whilc setting g = O, p 7^ O onc gcts a part of the stablc manifod 
W+. Such manifolds are colorfuUy called local whiskers. They are in fact "local" parts 
of larger "global" manifolds (see item (4) below) and can be called, for this reason, local 
staMe and, local unstable manifolds. 

(3) The motion of A, B is computed from that of the other coordinates by quadrature 
{e. g. B = —ed\f{ip,a,X), while B itself does not occurr in the equations of the other 
coordinates, and similarly for A). The symmctry of / does not imply that A, O have zero 
average over ip , if (p = it, [CG], and also the Lyapunov exponent g in general changes 
by a quantity 7 of order 0(/x) with respect to the unperturbed value g{pq). The motions 
encrgy also changes, in general, by an amount analytic in fi and divisible by /j. {i. e. of 
0{„)). 

(4) Once the local parts of the torus whiskers have been defined as above we can extend 
them to global objects by simply appiying time evolution to their points. 

(5) The case /i = O (one of the most classical results in Mechanics) is a non trivial 
exercise: it is developed in Appendix A9 of [CG] or Appendix Al of [G5]. 

(6) The stable and imstablc manifolds do not coincidc. in general, for /i =^ 0. 

(7) in the anisochronous cases of §7 a very similar result holds with the difference that the 
relation between a and ip is also non trivial and described by a function A{iIj ,p,q) = 
(A( ,p, g), 0) with zero average over ijj and divisible by fi and with the same domain 
of definition of the other functions in (3.3). In this case all the functions in (3.3), and 
A as well, depend also on Ao which must be restricted so that o; = (»7" + r]Ao,r]~^^'^) 
verifies a diophantine property with suitable diophantine constants C {t]), t: the size of 
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the radius of convergence and the value of c will depend on the selected C{ri) (like (2.2) 
or (7.3)). 

(8) The above theorem is well known; it is explicitly proved (in a much more general case) 
in the abovc form in §5 of [CG], scc p. 38. Its proof is a rathcr straightforward adaptation 
of Arnold's method of proof of the KAM theorem, [Al], as exposed for instance in [Gl]: 
the only element of "novelty" is perhaps the normal form of the motion in the coordinates 
ip ,p,q, as long as the latter two rcmain in thcir domain of dcfinition. A more modern 
proof, based on Ehasson's method, [E], for proving KAM (as exposed in [G2],[GG]) and 
extending it to the problem of tori of one dimension less than the maximal, can be derived 
from [Gel] (where only the cascs p = O or g = O arc studicd). Also unusual is the absence 
of the twist condition (present, and necessary, in the more general proof in [CG]): it can 
be eliminated because of the special structure of the hamiltonian (2.1). But it is a long 
and unintercsting proof for us hcrc. Wc shall not nccd, howcvcr, the normal form and, 
as it will be clear, we only need the classical results in the weaker form discussed in 
[Gr],[Gel]. 

§4. Splitting angles. A recursive determination. 

The symmetry of / implies an intersection a± (p = it and a = O (see below or, for 
instance, p. 363 of [G3]) between stable and unstable manifolds of the torus into which 
the unpcrturbcd torus {i. e. A = B = O, I = O, f = 0) is dcformcd by the pcrturbation. 
Therefore we set up an algorithm to study such intersection. For this purpose it is 
convenient to work in the original canonical coordinates and write the stable and unstable 
whiskers Wj^ as: 

= {{^, a, I, A) = (^, a,I^{v>, a), A±(^, a))} (4.1) 

with a eT^,£< \ip\ <27r — e where e > O can be fixed a priori as small as we please 
providcd we diminish the value of the analyticity radius /io in theorem 1. 
In other words the whiskers deformation is of O {p,) in every closed subinterval of 
(— 27r, 27r): therefore they remain parametrizable by (p, a for (p in any closed subin- 
terval if fi is small enough (just note that for /x = O they are parametrizable). We say 
that, in such region of {(p, a), they are graphs over the angle variables. We only need 
that p = TV is allowed (hcncc s — ^ will do). 

We define the splitting vector Q (a ), the splitting or intersection matrix a,nd the splitting 
between W+ and W~ a.t (p = n and a = O , respectively, as: 

Q(a) =^ A + (7r, a) - A-(7r, a), D'^=^ Q{a)\ anddetL» (4.2) 

A relevant remark (Lochak and Sauzin, private communication) is that the whiskers are 
lagrangian manifolds so that, for a suitablc gcnerating function S^{p, a), the whiskers 
have equation: A"^ = daS{p, a),I^ = d^S^{p, a) around every point {ipQ, a^) where 
they can bc locally rcgarded as graphs over the angles {ip, a), e. g. at p = 'n and for ji 
small. This implies that D is symmetric, as we indeed find in (6.10), (7.10). 
We now derive recursive formulae for I^, in (4.1) and their time evolution, keeping 
in mind that for /i = O it is = = I^{ip) = g\/2{l — cos p), A"^ ^ Q. The unper- 
turbed motion is simply: X°{t) = {p°{t), a + wt, I°{ip°{t), 0), where {ip° {t) , 1° {p° {t))) 
is the free {i.e. with /i = 0) separatrix motion, generated by the pendulum in (2.1) set- 
ting the origin of time whcn the pendulum swings through p = n. The foUowing §4, §5 
are presented here only for completeness: although selfcontained they are not meant as 
a substitute of the work done in [G3] but serve the purpose of guiding the reader to dig 
out of that papcr what hc may want to see in more detail. 

Let X^{t; a ), a = ±, he the evolution of the point on Wj^ whose initial coordinates are 
given by (tt, a,/^(7r, a), A'^{'jt, a)); from now on we shall fix initial data with (p = tt 
(which amounts to studying the whiskers at the "Poincare's section" {(p = tt}). 



5/ febbraio/2008; 18:17 



6 



The analyticity in /i implicd by thc abovc thcorcm 1 and tlic analyticity propcrtics of 
the Jacobi functions Ro,So allow us to consider the convergent (if c in (2.1) theorem 1 
is large) Taylor series expansions, in /z, of the whiskers eguations. Let: 



X;(t)^X;(t;a)t/'^X'=-(t;a)£'=^ ^X'=-(^)£^ a = ± (4.3) 



'"'E- 

k>0 k>0 



be the power series in e = firj'^ of (convergent for /x smah); note that = X° is the 
degenerate unperturbed whisker. We shall often omit writing expUcitly the a variable 

among the arguments of various a dependent functions, to simphfy the notations. 
Theorem 1, §3, tells us that the t-depcndence of X^{t) has the form: 

X;(t) =X;(a;t,i;a) (4.4) 

where X^{'il) ,t; a) is a real analytic function, of all its arguments {p, included), which is 
periodic in ip and a . And in fact in our isochronous case the dependence on ip and a 
is via a + ip ("no phase shift" in the sense of [CG]). The two functions X'^'^{t) will be 
regarded as forming a single function X'^{t): 

^'W = (S-S 1!'^ = ^^^"! = + (4.5) 
\X'^ (t) if(T = signi = - ^ ' 

We label the 6 com.ponents of X with an indexj, j = 0,1, ... ,5, and write them fnotation 
used in [G3] j, with the convention: 

and we write first the angle variables ({ip,a,X) = {X-,Xi)), then the action variables 
({I, A, B) = (X+ , X I ) j; first the pendulum, then the rotator and then the clock variables. 
Therefore at order O in /i: 

Xo(t) =(^°(t), X,{t) = oo + rft, X2{t)=\^+r^/H 
X^{t) =l\^\t)), X^ = O^X,, 

where (p'^(t) is the frcc separatrix motion (i. e. (f>'^(t) = 4arctge~^*, see Appendix Al). 
Inserting (4.3) in Hamilton's equations for (2.1) and comparing the various orders in /x, 
the coefHcients X'''^{t) = X'''^{(jt,t; a) are seen to satisfy the hierarchy of equations: 

^X'"' = = LX^'' + F^" (4.8) 

where L is a 6 x 6 matrix with only two non vanishing clcmcnts L03 = 1 and L30 = 
g"^ cosip^{t). Expanding X'' in powers of e and imposing that the equations of motion 
are verified, we find recursively the expressions for F. For instance: 

Fl=-d^f, F] = ~d^f 

Fl=-^- sin^ (Xi)2 - d^2fXl. - d^^fX\ 

F]=-d^^fX\-d^.fX\ (4.9) 

2 2 

Fl=-^ 2X\Xl sin^ - |j cos<^(Xl)=^ - d^.f Xl+ 

F I = - da(pfX'i - da^f KI - -^^a^f " d^a^f - -d aip^ f X\Xl_ 
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where the functions are evaluated at ip{t) = f^{t), a (t) a + uj_t. Morc generally, F'' 
depend upon X°, X''-'^ but not on It is = F J s O for all k>l. 

Expressing the solution of a linear inhomogeneous equation like the one in (4.8) can be 
conveniently done in terms of the wronskian matrix. We recall therefore the notion of 
wronskian matrix W {t) of a solution t ^ x{t) of a differential equation x = f{x) in i?". 

It is a n X n matrix whose columns are formed by n linearly independent solutions of the 
linear differential equation obtained by linearizing / around the solution x and assuming 
W [O) = identity. In other words W {t) is, in our case, the solution of the differential 
equation W{t) = L(t) W(t), W{0) = 1. 

The solubility by clomentary quadrature of the free pendulum equations leads, on the 
separatrix, to the foUowing formulae that have importance bccause the wronskian of the 
free separatrix motion can be expressed in terms of them. If (p" {t) = 4arctane~^*: 

COS(p° =1 



n „ sinh gt , 1 — a;^ 

""'^ (^^^ = (TT^ 

for X = e^* or, as well, for x = e~f*. Eq. (4.10) leads, see (Al.l) in Appendix Al, to the 

foUowing cxprcssion for the wronskian W{t) of the separatrix motion for the pendulum 
appearing in (2.1), with initial data at t = O given hy (p = tt, I = 2g: 

rrr,.-, ( ^SilT^ f \ _ _ 2gt + svoh^gt 



cosn 



Notationally we follow here [G3] (in [CG] I, <p are exchanged). The evolution of the X± 
{i. e. I, ip) components can be determined by using the above wronskian: 

jE)=m^)(^.2(0))+^W/V-^(r)(^,.%) dr (4.12) 

Thus, denoting by Wij {i, j = 0,3) the entries of W, we see immediately that: 

X^-(t) =wos{t)Xl''{0) + wosit) /o u;oo (T)F^-(T)dT - woo{t) J* wo^{t)FI'' {t) dr 

{t) =«;33(t)Xf (0) + wzz{t) Si Woo{T)Fl-{T)dT - Wsoit) /o wo3{t)FX-{t) dT 

(4.13) 

Integrating on the separatrix (4.8) for the t, J, components is "easier": one can find 
it directly or, more systematically, by writing the full 6 x 6-wronskian matrix of the 
equation (4.8), which is trivially related to W {t) above; we shall not write it here: see 
eq. (4.21) in [G3] for an explicit expression. The result is: 

Xt^(i)=0, Zf^(t) = Xf (0)+ / £|"(T)(iT (4.14) 

Jo 

having used that the 1*^(0) = O , because the initial datum is fixed and /j--independent. 
If (2.1) is modified by adding an isochrony breaking term the first component of 
the first of (4.14) becomes: 

X^-{t) = r, (tX^(0) + f\t- t)F^-{t) dr) (4.15) 
Jo 

where X^'^ is the first compnent of X_Y , see eq. (2.18) in [G3]. 
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Equations (4.13), (4.14) can bc uscd to find a rcasonably simplc algorithm to rcprcsent 
whiskers to all orders A: > 1 of the perturbation expansion. It is very important to 
keep in mind that the initial data in (4.13), (4.14) are not constants: according to the 
convcntion following (4.3) thcy can dcpcnd on the a variables of the initial data. This 
means that the functions X depend separately on a and Lot. Except when, as in (2.1), 
the hamiltonian is linear in the A, B variables. In the latter case the dependence on 
a and cot of the r. h. s. of (4.4) (where the notation is complete and all variables are 
indicated explicitly) must be through a + Lot, since the a angles vary as a = w . 

Note that the case (2.1) is non trivial and, in fact, very interesting: it is eguivalent to 
a problem on a non linear quasi periodic Schrodinger eguation, sce [G2], [BGGM]. The 
extension to anisochronus cases {i. e. with a quadratic term in A added to (2.1)) is worked 
out in [CG] up to third order: a general analysis can be found in [G3]. 

The initial data (still unknown) in (4. 13), (4. 14) are detcrmincd by imposing the correct 
behavior at ±oo, and the correct dependence on a and ojt {i. e. a dependence on these 
two vectors through their sum). These conditions arise from the fact that the motion 
must be asymptotic to the quasi periodic motion on the invariant torus whose whiskers 
are described by X+ ot The scheme to do so is the following, see [G3]. 

Note that Wo3, «^33 in (4.11) behave, ast ^ aoo with a = ±1, asymptotically as ae^*'^ /4 
and —e^^" /A, while the other two matrix elements become exponcntially small. Hence 
we see that the tems in (4.13) proportional to w^zit) or Wzoit) diverge, in general, as 
t aoo exponentially fast (supposing the integrals convergent, i.e. supposing F!^'^{t) not 
growing faster than a polynomial in t, as it will turn out to be). But they are multiplied 
by: (X^'^(0) + /o Woo{t)FI'' (t) dr) so that the condition to determine X^'^(0) is: 

/><70C 

Xf (0) + / wooiT)F'mT) dT = (4.16) 

Likewise, in the isochronous case, from the second of (4.14) we determine ^^^'^(0) by 
imposing that ^^'^(i), i.e. the momentum component corresponding to the isochronous 
angles a, A, depends asymptotically on t only via a + ut: this will determine X ^'^(0) 
up to a constant. 

And the average over a of F^'^ (t) must tend to O as t — > croo (otherwise the second of 
(4.14) could not possibly be bounded as t aoo: but it has to be such because X | "'(<:) 
has to be bounded, by theorem 1). This means that the constant is not a function of a 
and can be fixed arbitrarily: however we want that the averages of the S, F in (3.3) are 
O so that X'"^{0) is completely determined. One finds, for both components: 

/•(TOO 

Xf^(0)+ / Zf('r)dr=0 (4.17) 
Jo 

where the integral is usually improper: see (5.1) below for a proper definition (derived 
by simply looking at the meaning of the conditions imposed to determine ^1*^(0)). Eq. 
(4.17) remains the same even in the anisochronous cases (see eq. (4.5) in [G3]). 
The key to concrcte calculations is that, / in (2.1) being a trigonometric polynomial, 
the function F^ (see (4.9)) belongs to the class A4 of linear combinations of terms like: 

M{t) = ii^x''a'^e'P^-i^^ (4.18) 

with h > 0,k integers, = 0, 1, p = ±1 and gk ±iLU ■ v_ O and we set x = e"^*'^ with 
a = sign t = ±1. In fact F^ is a finite linear combination of harmonics u. By (4.10) we 
see that (4.11) is an analytic function of the variable x = e^* and of t, ov of x = e~^* and 
of t: the explicit f-dependence arises because of the term uJ in (4.11). 
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Expressing as a sum of monomials likc (4.18) rcquircs a (convcrgcnt) infinitc suni 
over k, h. In the case of F^, in fact, there are no monomials with a power of t higher 
than O {i. e. with h > O 'm (4.18)). By induction all the F'' have the property of being 
expressible as sums of monomials like (4.18). Howcvcr, starting with the sccond order, 
one sees that powers of t do appear (this reflects, see theorem 1, that the function 7, 
describing the change of the Lyapunov exponents to ±(1+7)5 with 7 analytic in fj,, is not 
identically zero; see, however, [Ge2]). A fuU description of the induction can be found in 
[G3]. 

For the analyticity properties of the series introduced above we refer to §2 of [G3] and 
we proceed to a quick discussion of the determination of the the initial constants. 

§5. Improper integrals and the operators X, 0,00- 

The integrations in (4. 13), (4. 14) can be expressed in terms of an operator T acting 
hnearly on finite linear combinations of monomials like (4.18) with k'^ + {uj_ ■ u)^ > 0: 



lM{t)'^^ I M{T)dT, with: 

J aoo 



where the first row is a formal dcfinition whose mathematical meaning is given by the 
second row (note that if A; < O the first line is an improper integral), and we set g — 1. 

Note that the X is not defined on the polynomials of t, a, i. e. if A; = O and lu ■ i;_ = O 
(so that no exponentials are prcscnt in the monomial defining M). It can be naturally 
extended, for j > O, to the polynomials by setting It^a^ = j^V^^ , see (3.7) in [G3]. 

The T is an integration with respect to t with special initial data: in fact at fixed a: 

dtIM = M (5.2) 

If M is such that M (t) = M{u)_t, a) for some M( , a) defined on the torus, then: 

IM{t) = {oj- d^)-'^M{ojt,a) (5.3) 

The integrals in (4. 13), (4. 14) can be expressed in tcrms of the operators: 

OFit) =wo3{t){l{wooF)))it) - waa{t){liwo3F)){T)\l^ 
0+F{t) =W33{t){liwoaF))it) - W3oit){lwo3)F){T)\l. (5.4) 
jV(t) =l'^F{t) -X^F{0'') 

where a = sign t. Then one finds, in the general anisochronous case: 

X'l{t) = OF'^{t), Xl{t) = 0+F'^, X\{t) = r]EfFJ^{t), (t) = I F 'l {t) (5.5) 

where E is the projection over the first component, see (4.15), and F'^ have to be ex- 
prcssed in terms of the X^ with h' < h. 

Since, in this section, we consider an isochronous model the above formulae are slightly 
simpler as X^(i) = O, i.e. must take E = O in. the third of (5.5). In §7 we shall need, 
however, (5.5). 

Thcrc is no difhculty in setting up a general recursive scheme for the computation. We 
just give the result using the convenient notations {kj > l,mj > 0): 

(kj) m,p = {ki, . . . ,k':^^,ki, . . . ,k^^, . . . ,kf, . . . ,kf^^) s. t. ^^fc]=p (5-6) 
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referring to §2 of [G3] for more details (if needed). If /i ^= /, /o ^= cos (p^ we find: 

^ / — 1 mi 

F^_^^Q, Ff^G, fy=- e E nn^'^^ 

* - l — l rui 

E -tc^^^v^/^) E (^o/.)n. E nn^?" (5.7) 



|m|>2,5=0,l — |m|>0 (fcpm »=0 J=l 



where {kj)m,k, {kj)m,k-i are defined in (5.6). The * means that if 5 = O only vectors 
m with I TO I > 2 havc to bc considcrcd in tlic sums. Notc that if S = O thc sum in 
the expression for F^ can only involve vectors m with m j = O if j > 1, because the 
function /o = g'^cosif depends only on tp and not on a, (hence also k j = O if i > 0). 

The functions are evaluated at (p{t) = y^^{t), a{t)'^= a + w t. The indices in (5.7) are 

mutually contracted with a natural rule that we leave to the reader to work out. 
The relations F ^ = O and f ^ = O are general (as the equations for X ^ , X- are linear) 
and in the isochronous cases: 



x\{t) = - j(a„/), x^t) = -x(d^^fo{-d^fy^ 

X]{t) = -l(d^^fO{ -^sm^O{d^f)0{d^f)))+ (5.8) 

(see the examples in (4.9)). We fix our attention on the models with / a trigonometric 
polynomial ("trigonometric perturbation" ) of degree N: 

f{a,ip) = fs{ip,a,X) + fF{<p,a,X) 

^ (5 9) 

fj{ip, a, A) = E fj,{n,u) C0S(!/1Q! + Z/2A + IKp), j = S, F 

with |n|, 1 I < and f s. (n. u) = O unlcss 1^2 — O, i. e. £ is a slow mode, whilc f f. (n. 1/ ) = O 
unless £2 7^ 0> £ is a fast mode. We also say that f s depends only on slowly rotating 
angles and /f depends on fastly rotating angles. A nontrivial example can be: 

/(a, A) = ( cos(a + <f) + cos(A + t^)) (5.10) 

The intersection matrix to order h, D^-, can be expressed, to order h= 1, 2, 3 as: 

/00 /"OO 
dt dijf, Di = - dt [dijof O{dof) + djofO{diof)] 
-00 J —00 



£»22 = / dt 



W30 O{d220f) O{doff + 2W30 O{d20.ff 0(9o/)+ (5.11) 



+ d200fO{d20f) O{dof) + doofO{d22of) O{dof) + 

+ doof O{d20f) 0(820 f) + ^^2200/ O{doff + d200fO{d20f) O{dof) 



where the derivatives of the /'s are evaluated at ip{t), a + uj_t, with (f (t) = (p°{t) the 

free separatrix motion, see (4.10). We set do '== and di '''= do^ ; the a 's have to be set 
equal to O after evaluating derivatives. The expressions contain improperly convergent 
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intcgrals (in general) and must be understood by thinking as + J(^°° and by 

using the definition (5.1), see [G3]. 

It is convenient to split the operation O, see (6.5) in [G3], as: 
0{F) =Oo{F) + \wo3{t)\ G(o)(i^) + wooit) G^^\F) 

Co(i^)=^(y" dT{wo3{t)woo{T) -woo{t)wo3iT))F{T) + j dr {same)j (^^12) 
G(0)(F)=-/ dTWoo{r)F{r), G^'\F) ^ - dr \wo3{t)\ F{r) 

^ J+oo ^ -Z+oo 

The identity, see (6.12) and Appendix A2 of [G3]: 

/+00 /.+00 
dtF{t)0{H){t)= I dtH{t)0{F){t) (5.13) 
-00 J — CO 

implies symmetry of the above matrices Dij, at least for the first three orders (see (5.11)): 
symmetry follows to all orders as said after (4.2), or as it will be seen in §6. 
In the anisochronous case, i.e. in §7, we shall also use the sphtting: 

I^{F)=ll{F) + \t\G^'^\F) + G^^\F) 

^oiF) = \{J_ dT{t-T)F{T) + drisame)) ^^-^^^ 

1 ^-00 1 (.-oo 

G(2) iF) = - dT F{r), G(3) iF) = - dr |r| F{t) 

^ J+oo ^ J+oo 

see eq. (6. 3), (6. 6) in [G3]. 

The key remark, to understand the asymptotic behaviour of (5.11) as r; ^ O, is that 
whenever the integrand is analytic it becomes possible to shift the integrations, over t 
and the r's, to an axis close to Imt, Imr = ±(^ — rj^^^): see §8 in [G3] (choosing the 
free parameter S appearing in [G3] as r?^/^). 

Using that G(i) (F) = G^"' {F) = O if F is odd (as the odd derivatives of / are, when 
evaluated at a = 0) and using also that Oo leaves parity unchanged, in general, we 
shall find that the non analytic terms {i.e. those containing integrals of a non analytic 
function, e. g. \wo3(t)\) cancel each other'm their contribution to the determinant of Dij 
to all orders k > 1. The result will be the proof of the foUowing theorem: 

Theorem 2 ("Large angles theorem"): Consider a system described by the hamiltonian 
(2.1) or (7.1) with f an even trigonometric polynomial. Let n be small (|/^| < no) and 
c large, enough. Consider an invariant torus with diophantine rotaMon lo among those 
described in theorem 1 above. At the homoclinic point with (p = tt, a = O the intersection 
matriz determinant is ezponentially small asrj ^ O and it is generically asymptotic to its 
first order value ("Melnikov' formula"). The choice given by (5.10) is a concrete example 
of this result, generically Holding for (5.9). 

Remark: The result in the casc / is given by (5.10) is detD = 247r?7~^/^e^e^5'' ^''^ in 
both cases (isochronous, (2.1), and anisochronous, (7.1)) with a > 0. This theorem has 
an analogue with and a = i.e. only two time scales: an early review is in [G3]. In the 
latter case it has been extended to analytic perturbations, i.e. bcyond the trigonometric 
case, and to cover the exact asymptotic value of the splitting, i.e. far beyond [G3], see 
[DGJS], [RW]. The above three scales case, (2.1), is quite different from the two scales 
case (discussed in [G3]): but arises naturally in celestial mechanics problems near a 
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double rcsonancc, as in tlic' casc; of tlic' precession problem in [CG] to which we hope to 
apply, eventually, the results of this paper. 

The case in (2.1) is that of a pair of "clocks" and a pendulum. The oase of a "clock", a 

"rotator" and a pendulum is exemplified by the hamiltonian obtaincd by adding ^rjA^ 
to (2.1), scc (7.1) bclow. Both cases are treated in full detail in §6, §7. 

§6. Nonperturbative splitting analysis in presence of fast and slow rotations. 

It would bc casy to show that the dctcrminant of the intcrsection matrix is exponcntially 
small to order e^: this requires evaluating the intersection matrix only to third order. But 
it cannot be done without due care, as the error in [CG] was precisely due to the belief 
that it was not ncccssary to evahiatc the matrix clcmcnt D22 because it was cxponentially 
small. In fact it is not exponentially small and it has the right value to make, instead, 
the whole determinant exponentially small. 

The real problem is to compute the determinant to all ordcrs and to show that to all 
orders it is exponentially small: i. e. to all orders the determinant is a sum of "large" terms 
(not exponentially small) which "cancel each other" with a result that is exponentially 
small. So small that the first order calculation dominates in the limit as 77 ^ 0. 

It is remarkable that in fact one can give an exact expression for the leading corrections 
to the first order of perturbation expansion. See (6. 12), (7. 19). 

This scction relics, and in fact it follows almost immcdiatcly, from the general thcory of 
the intersection matrix in [G3]. We cannot repeat here the general theory and therefore 
refer the reader to [G3] for details on the main definitions: we try nevertheless to make 
what follows rcadablc at least from a formal vicwpoint and as a guide to [G3] . The point 
of [G3] is that the intersection matrix can be quite explicitly calculated to all orders by 
using a graphical formalism very similar to that used in quantum field theory when the 
Schwingcr functions are cxprcsscd via Fcynman's graphs. 

In the present case the graphs will be, topologically, trees: very unusual graphs from 
the viewpoint of field theory, where loops are often the main source of interest and non 
triviality. On the other hand the graphs havc nodcs with arbitrarily large coordination 
number: also unusual in quantum field theories (with polynomial interactions) . 

Let ^? be a tree built with oriented lines all pointing towards a "highest" node r that we 
call the root and that we suppose to have only one "incoming" line, see figure below. 

def 

The graph will bear a label Sy = 0,1 on each node: if 5^ = 1 it represents / = /1 while 
if ^„ = O it represents foif) = 9^ cos (f. And the labels can be given arbitrarily with the 
restrictions that all endnodes bear a label 6=1 and that all nodes bearing a label ^ = O 
have at least two incoming lines. Each node will also bear a "time" label r^. 

We define the value of a graph by building the following symbol. We first lay down a 
set of parenthcscs () ordcrcd hicrarchically and rcproducing the trcc structure: in fact 
any tree partially ordered towards the root can be represented as a set of matching 
parentheses corresponding to the tree nodes. Matching parentheses corresponding to a 
node v will bc madc casy to see by appending to them a label v. The root will not be 
associated with a parenthesis. 

Inside the parenthesis („ and next to it we write —dg'^^^^fs^ for all nodes v lower than 
the node vo preceding the root ("first node"). For v = vo we write —djdo" fs^^ where 

def def 

do = and d j = 9«^ , j = 1,2 (this implies that 5^^ = 1): the functions have to be 
evaluated at (¥'°(t), a + Lot). 
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Vn 

A graph S with pvg = 2,pvi = 2,pv2 = 3,pv3 = 'i,PVi = 2 and fe = 12, J^pti! = 2* ■ 6, and some 
decorations. The line numbers, distinguishing the lines, and their orientation pointing at the root, are 
not shown. The Unes length should be the same but it is drawn of arbitrary size. The nodes labels Sv 
are indicated only for two nodes. 

OTitsidc the parenthesis wc writo O for all the v < vq and wc add to the right of 
the matehing parenthesis the symbol (t„); for the first node we simply integrate over Ty^ 
from +00 to — oo. 

The symbol thus defined has the meaning of a Unear combination of products of multiple 
integrals if one uses the definitions of the symbols O, see (5.12). We multiply it by 
n!~^ if n is the number of hnes in the graph and we shall regard all the lines different 
(i. e. labeled); however two graphs that can be superposed, labels included, by successively 
rotating rigidly around the nodes subtrees that are attached to them have to regarded 
as identical. This defines the value of a graph (it is a function of a). The reader can see 
that the above is a rathcr natural construction by working out patiently the definition 
in the case of trees with one, two or three nodes. 

The sum over all graphs of "order" k = J2v °f graph values gives the coefBcient 

of order k of the spiitting vector Q (a): scc [G3] whcre the above construction is 
perfornicd in Fouricr transforni to obtain dircctly an cxprcssion for Q ^ . 

It is convenicnt to makc this morc cxplicit by using the dccomposition of O in the first 
line of (5.12). This can be easily done by simply attaching to each node lower than 
the first a label /3 = O, D, R signifying that wc select the first of the three terms in 
the dccomposition of O (see first line in (5.12)) or the second or the third. We can 
alternatively imagine drawing a circle around each node v enclosing only the subtree 
with that node v as first node and then to attach the label to the circle. 

By (5.12) we see that a circle with a D or R label encircling a node w linked to the higher 
node v (external to the circle) rcprcscnts just a function |wo3('''t;)| or Woo{tv) multiplied by 
a number that in order to bc cvaluated requires essentially the same operations required 
to evaluate the value of a graph 'd. 

This allows us to give a nonperturbative expression for the spiitting vector Q (a). We 

simply consider the sum of all the values of the graphs bearing a label O on all the nodes 
except perhaps the endnodes that can bear also D, R labels. We evaluate the graphs 
values and in the end we replace the number associated with the R, f-labeled endnodes 
by the full pert,urbation series that is obtained by imagining that inside the circles with 
R and D labels there is the most general graph with O, R, D labels in all possible ways. 

The sum of such perturbation series will be denoted G^^^a) for D-labeled circles, 
and G'^^-'(a) for i?-labeled circles. The new rcprcscntation of Q{a) is therefore a 
representation in terms of trccs with a few ^^fruits^^ around some of the endnodes (possibly 
none or all) that can be D labeled or ii-labeled {dry or ripe, to foUow the names of [G3]). 

Furthermore G^^^ and G^°) verify a simple recursion relation that can be found by a 
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morc cxplicit roprcscntation of tho quantitics dcfincd in thc same way as G^^\G^^\ Q 
hut evaluated by considering only trees deprived of fruits, see (6.4) below. 
Let 1? be a graph whose nodes v carry indices (5„ = 0, 1 and t„; let vo be the first node 

of 1? and v' > v hc (if v < vq) thc nodc following v. 

Fixed 1? and setting w{t' , r) = wo3{t')'Woo{t)—'Wo3{t)'Woo{t') define the function A{{Ty}) 
of the time labels Ty associated with the nodes by: 

^= W( n Mrv',ry)s'^dE^+'fsM)d'o^''fK,{r.,) (6.2) 

V<Vo 

where p„ is the number of lines entering the node v; /i (r) = fs{(p{t), a + uj_t) and n is 
the number of nodes of the graph i?; d^fsir) means {d^fs){(f°{t), a + uj_t). 
Wc define also A^^^y^^^^ by the same cxpression as A with cxtra derivativcs di, dj, d„i, . . . 
acting on the function fs^ , fs^ , fs^ , ■ • ■ in (6.2) . We shall say that i,j,m,... are marks 
on the nodes x,y,w, . . .. In terms of graphs we simply represent ^^f^%','.'.'. by affixing also 
"marks" i,j,m,... to the nodes x,y,w, Then let, for j = 1, 2: 

r(^H«) = E^/K3(r.JK(N}), _ 

r("H«) = E^/^oo(r.„X({r4), r,(a) = 2E^/<({T4) ^ ■ ' 

the integrals ranging over Tvq € (+oo, — oo) and over Tv G (— oo, r,,') U (oo, r„') for v < vo- 
Given the above definitions, G = (G(°),G(i)) and T = (r(o),r(i)) verify the following 
(analogues of the well known field theoretic) "Dyson equations" (see also [GGM]): 



GW(a) =r(0)(a)+5^ [ AO^>oo(t.o) (ko3(r,)|GW(a) + t«oo(T,)GW(a)) + .. 

GW(a) = rW(a) +^ / <%K3|(t.o) (\wosiTy)\G(°\a) + t«oo(r,)GW(a )) + .. 

(6.4) 

where the r. h. s. represents the contributions of the trees with no fruit or with just 
one fruit and the .. denote contributions from the trees with more than just one fruit. 
Equation (6.4) is obvious if one writes the r.h.s. as a sum of graphs and sees its graphical 
interpretation: note that (6.4) is a "non perturbative" identity since F , G are infinite 
series in e {i. e. they contain all the contributions from graphs of any order). Then: 



Q,=r,+2^ / ^ AifJ\wo3iT^)\G^''\a)+woo{T,,)G^'\a))+... (6.5) 

with the above meaning of the ... {i. e. up to sums involving at least two fruits). 

The convergence of all the above series and the estimates about their remainders are 
nontrivial and an open problem: they are, however, finite order by order as it foUows 
from thc analysis of Appendix Al of [G3]. Therefore what foUows can be interpreted as 
a collection of identities valid order by order in the expansion and as a way of deriveing 
and expressing compactly infinitely many identities. This will be sufficient for all our 
conclusions; nevertheless one would gct bcttcr bounds if one could prove convergence. 

Consider ^ , vf^ , \ rf \ Tij defined as the di derivatives of G(°) ,T^°\G^'^\r^'^\Tj, 
see (6.3), (6.4), evaluated at a = 0. The terms hidden in the . . . contribute O since at 
the homoclinic point by parity G^") = G(°)(0) = O and, as well, G^^) = G^^^O) = 0. 
Differentiating (6.4) and cvaluating at a = O wc gct the remarkable exact rclations: 

G f = rf ) + Moi G f + Mn G« , G« = r« + Moo Gf ^ + M,o Gf^ (6.6) 
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with the matrix M = ( ^^"^ I defined by: 

\^ Mio Mil J ^ 

f J2i>-yei> Ko,v\'^03{Tvo)\\W03{Ty)\ T,i>-yei> ^vo,yW00{Tvo)\W03{Ty)\\ ^g^^ 

v T,^;ye« '^vo,y\^03{Tvo)\wooiTy) T,«.y^^ J~°° A°'°yWoo{Tvo) WOo{Ty) J 

M is symmetric: this is implied by the symmetry of the operator Oq, which follows im- 
mediately from that of O (sce (5.13), and Appendix A2 of [G3]: or proceed inductively) . 
More generally the symmetry of O^, repeatedly apphed (as done with O in [G3]), im- 
plies that E^.j, / A^vL^iT,„)HiTy) = E^,^ / A'^'^%H{n.„)F{Ty) for all F, H: used several 
times below. 

Going back to (6.5), in order to evaluate the derivatives diQj{a), i, j = 1,2 a.t a = O 
we need only the Hnear terms and the . . . contribute exactly 0. Therefore, if we set: 

r,, =2 ^ / A^i^y (6.8) 

and note that G = (1 — aM)~^ F with cr = ^ ^ ^ ^ , then the intersection matrix D is: 

Dij = Tij + 2{vfGf^ + Tfof^) (6.9) 
which, setting C = a{l — aM)~^, becomes: 

/rii + 2(r„cri) T,2 + 2{r^,cr,)\ 
^- Vr2i + 2(ri,cr2) T22 + 2{T^,cr^) J ^''•'"^ 

The matrix C is symmetric, because M is symmetric; hence D is a symmetric ma- 
trix (to all orders as (6.9) is exact): this agrccs with the mentioned result (see com- 
mcnt foUowing (4.2)) of Lochak and Sauzin on the symmetry of D. Noting that r2'''* = 
gnp(o)(n)^ _ g^p^^^^ .y^rith an explicit calculation yielding in the case (5.10): 

r(0)(i) = i^-V2r« = 2nn-'e-^^-''\l + 0{e-^-^-''')) (6.11) 

An elementary (and somewhat surprising) calculation of the determinant yields a re- 
markably simplc cxprcssion in which the first non trivial order of det D [i. e. the second) 
and the possibly non exponentially small contributions are evaluated without any ap- 
proxim.ation: 

det D = rli¥^2^ - 2 (riiMii - 2rf ^ V2^^^ dct c + . . . (6.12) 

the last . . . coUects contributions to det D from the products in det D involving ri2, 
(for instance — 4ri2( F i, CFj) or 2r22(r i, CF i), — F12 and a few others) or the higher 
orders of FiiF22. 

The terms not written in (6.12) contain as factors (see Appendix A2 below) derivatives 
82 ("fast angle derivatives") of integrals of an analytic function of the r^'s: if we imagine 
developing the kcrncls A as Fouricr scrics in a wc scc that only Fouricr components 
= (1^1,2^2) with J/2 7^ O can contribute. This means that the dependence on the r„ 
will be via the matrix elements w{tv',Tv) of the pendulum wronskian matrix and via 
e'S,,^"- -" with J2v ^t;2 7^ O that "the dependence on time is via a total fast 

rotation" ) . 

So that by shifting the integration over r„ to ~ ±z(7r/2 — ry^/^), i. e. almost as far 
cut in the complex as the singularities of the lu-functions allow, one sees that such 

TT — 1/2 

terms contribute a quantity bounded to all orders proportionally to e~2'' because 
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|a; ■ i^l > vi^^l"^ — Nhij^f^ if z/2 ^ 0: this givcs a good bound for h not too large, 
e. g. h < ?7~^^ (actually even for h ~ ?7~^), see Appendix A2 for details. 
For h > ri~^/'^ one can invoke the convergence of the series for detD and obtain a 
bound {rj~'^'^eY (much smaller than e"^'' because e = iirj'^, provided c is large 
enough). The conclusion is that the terms not written in (6.12) can be bounded by 
^3g-f?j 0{r]~^^), provided c is large enough so that the sum of the bounds of the 
orders £rom 3 to ri~^^^ is dominated by the third order bound: see Appendix A2 for 
details. 

The value of /3 can be takcn 2(A^ + 1) + Ad 'm terms of the order of the dcgrcc N 
of / and of the diophantine constant d in (2.2), it is explained by the singularities of 
the elementarily computable Fourier transforms of cos Nifi{t) and sin N(p{t), see [GR]. A 
similar argument is in §8 of [G3]. 

The above argument is good but it gives a rather poor bound: see Appendix A2 for 

some improvements. 

Hence the leading value of dct D is given, as rj ^ O, by its first order expression t''iiT^2 
plus the apparently much larger — 2(rii Afn — 2r^"-'^) v''^'''^ det C. But we shall show that: 

(-rnMn+2ri°'V2'^'detC = 0(?7-^'^)e^e-^''"'^' (6.13) 

because of the first factor bcing of 0{e^ri~'^^ e~^^ ^^^), i. e. essentially O, again assuming 
(temporarily) convergence of the series for G , F , F, M. 

The discussion of the convergence for the series for F, F , M is very non trivial, while one 
could show the convergence of the series for G, foUowing [Gel]. However the series for 
det D converges and its convergence, which is absolutely essential, foUows immediately 
from Theorem 1 above. 

It is remarkablc that wc can avoid proving the convergence of the power series for 
F, F, M, G and get away with only the easily established convergence of det D: in fact 
we can just use the above series as formal power series and that is all we really need 

{together with the convergence of det D). 

After all the identity (6.12) as an identity between formal power series and the formal 
bound (6.13) show that to all orders the det D is exponentially bounded and this is almost 

enough. In fact up to order rj~^f'^ one needs only bounds on finitely many terms {0{rj~^/'^) 
in the series for F, F , M, G , as explained above. For the remainder we use that it is 

— — 1/2 

bounded by the much smaller quantity {er]~'^y derived from the convergence of the 
series for dct _D (if /i in e = iirj" is small enough and c is large enough). The details are 
given in Appendix A2 below. 

We suggest to proceed however by, at first, assuming convergence of the series for 
F, F, M. G for erj^''' small enough, and only on a sccond rcading chcck that formal 
power series considerations (plus analyticity of det i') suffice: a technique used in [G3], 
§8. Here the difficulty is not the bounds but the cancellations and assuming convergence 
removcs uncssential worrics and clarifics the algcbra. 

We also take a = ^ in the following calculations to simplify notations: the general case 
is obtained by replacing the coefRcients explicitly appearing in the following formulae cj 
by ?7° and uj"^ by 772+° respectively. Wc set (p{t) = f^{t). 

For instance in the case (5.10) (with a = |) F^^^' = 4 + 0(?7^/^) by direct computation 
and, by (6.11), = Senrj-'^/'^e-i'"^'^^^ the leading term in (6.12) is: 

det£' = 247rr?-^/Ve-^''"''' (6.14) 
The reason why (6.13) holds is that if (jo'^= rj^f^: 

UJ U) L OJ 
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To provc (6.15) note that T,^^y J Al°yWoo{Ty) = E,?,y / ^SJyW'oo (t^o ) by the symmetry 
of O o; hence: 

Tn = 2 ^ y Ally, <y^ooiry) (6.16) 

Thinking (6.16) as sums of graphs wc sec that to cach graph with n nodcs contributing to 
Fii with the node y marked 1 (see definition prcccding (6.3)) as in (6.16) there correspond 
two graphs contributing, if y < wo, to Tf\ Namely the one with the node y marked O 
and the one obtained by deleting the mark O on y, adding a new node y' marked O on 
the hne coming out of y and with index Sy' = 0. U y = Vo we associate with it only the 
first of the abovc graphs. Note that the second graph has n + 1 nodes. 
Suppose that y is an endnodc in a graph i? associated with Fn: by A-kerncls definition 
we must evaluate, when computing the contribution to Fn from such graph, the quantity: 

Ooidiof) = -Ooidrdof) + -Ooi-^doof) - -^Oo{d2of) (6.17) 

OJ (jj CO 

where w ''="'^ ?7^/^(recall also that we take a = i) and having used di = -^{dr ^ V^a) — 
-^d2 (because the derivatives act on functions of spacial form F{(p{t), uj_t) with F{(p, a ) 
suitable) . 

Since = — 2woo we see that the second term is ^C'o(w^oo(t'i,.)9oo/): hcncc it appears 
in the cvaluation of the first among the two corrcsponding graph contributions to ^F^*^\ 
The second corrcsponding graph contribution to ^F^^-* will require computing: 

--Oo{woo{Ty,)dlfoOo{dof)) = -Ooivd^foOoidof)) (6.18) 

CO U) 

The combinatorial coefRcient associated with this graph is (n+1)! rather than n\: but one 
checks that this is what is necessary to verify that the difference between the contributions 
to Fii and ^F^°^ due to the three graphs considered requires computing: 

-Oo{dr,d^f -^dlfMdof)) - ^Oo(9o2/) (6.19) 

U) OJ 

Below we prove for all odd F{t) the following commutation relation: 

dr^, Oo{F) = 00 {dr,F - dyoOo{F)) (6.20) 
which, noting that below we only apply (6.20) for odd F's, allows us to rewrite (6.19) as: 

^dr^,Oo{dof)-^Oo{do2f) (6.21) 

Wc procccd by picking a new endnode (if any) and by rcpeating the construction until 
aU endnodes are exhausted. The difference between the contributions to Fn and -t\°^ 
considered so far will therefore be given by a coUection of graph values contributing to 
Fii with no marked node and with one of the endnodes requiring the evaluation of: 

-dr^,Oo{dof)-^Oo{d2o.f) (6.22) 

Consider now a l~marked node y which is next to an endnode; and suppose that there are 
p hnes hnking it to the endnodes. The differences between the contribution of the con- 
sidered graphs to Fn and of the corrcsponding ones to ^F^°^ plus the already evaluated 
differences due to the previously considered graphs will be, proceeding as before: 
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+ ^Oo[-^dlfoOo{dP+'fOoidofr)) - ^Oo[d2dt'fOoidofr)- (6.23) 



^ 1 



which is, by (6.20): 

-dr,Oo{9P+'fOo{dom - ^d20o{d^+'fOoidom (6.24) 

We proceed in the same way until we reach the first node and, in fact, we can treat in 
the same way also the first node except that in this case there is only one corresponding 
graph in F^"-*. No nccd, this tinic, of using the commutation relation (6.20) bccausc the 
Tu(j -integral is a simple integral not involving Oo operations any more. Hence eventually 
we find that the differences between the contributions to Fn and ^F^°^ add up to: 

where one should note that the first term in the l.h.s. vanishes: this does not immediately 
follow from it being an integral of a derivative because the integrals are improper: one 

chccks this by (5.1) and that Al^^ is even (at a = 0). 

In a similar way one shows thatMii = ^ (J^f ^ + ^^2^^ ) ■ Onc proceeds by writing F^'^'' as 
^1°'' = J2^yl ^vly'^oo{Tvo)j and marking O the first node for both F^'^'' and Mu. Then 

note that to each graph contributing to F^"^ with the node y marked 1 there correspond 
two graphs contributing to Mu also for y = vo, whcn Sv„ = 1, and they are constructed 
by foUowing the same prescription given after (6.16); moreover to each graph contributing 
to F^*^' with — O there correspond two graphs contributing to Mu, namely the one 
with the node vq marked O twice and the one obtained by adding a new node v'q (which 
becomes the new first node), marked O twice and with 5y'^^ — O, on the line coming out of 
y. Then proceed as before, with the only differencc that, at the last step, when dealing 
with the contributions arising from the graphs with Svo = O, one has to use the identity 
Co(woo simp) = -5^^ = ^fiT^ = -j'fi- This can be proved in the same way as 
the commutation relation (6.20). 

Let £ = ( ) and T = ( r^nfL)'- t^e vector T verifics T = L{t)T + F 



F J - \dtOo{F) 

where L{t) = (^^^^^ q^ • ^y differentiating with respect to t both sides we get T = 

L{t)T + P + ( n)</5Z! which means (6.20), up to a homogeneous solution 



t — > W{t)X, X B? oi the latter linear equation, when written for the first component; 
the oddness of F is necessary to check that X = 0. Choosing T = ^ ^ and F = O 
one gets in the same way the above mentioned identity (recall = —2wqo). A similar 

identity is obtained by choosing T = ( ) , useful in §7. 

V ^03 / 

This completes the proof that the cancellation discussed in §5 is in fact taking place to 
all orders of perturbation theory. 
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§7. The splitting in anisochronous cases. 

We now proceed to the analysis of an anisochronous case: 

H = n''A+ -^B + rf'^^ + ^l+g^ (cos<^ - 1) + £ /(^, a, A) (7.1) 

with e = fiT]" and c will be chosen large enough. This model belongs, see [T],[G2], to 
a wcU studied class of models ("Thirring models") and it is a simplified version of thc 
hamiltonian considered in [CG] in §12. If we added (7.1) a further "monochromatid^ 
term /o((/J, a, A) which has Fourier harmonics i/, for (a, A), multiples of a given fast 
harmonic (z. e. a harmonic with the component 1^2 different froin 0) it would offer 
all the difhculties of that case in spite of being much simpler analytically. We shall not 
attempt the analysis of such more general hamiltonian: the error in [CG] eliminated 
almost completely this problem which, hence, has to be studied again if one wants to 
recover the results of that paper. 

Consider, as an example, the case (5.10). This time the first problem is to find for how 
many values of A one can have, for e small, an invariant hyperbolic torus with rotation 
number: 

o; =(,,'' + r?2- A, ,7-1/2) (7_2) 
and e-close to an unperturbed one. 

For hamiltonians hke (7.1) the average position in the A variables of the torus with 
rotation (7.2) will be exactly A (this is a general property of "twistless tori" , as discussed 
in [G3],[G4]). The avcragc position in the B variables will be chosen so that the energy of 
the torus is a fixed value, e. g. 0: this can be done bccause the fast variable A conjugate 
to B is still isochronous (see §5 in [CG] for the niorc; general case in which also A is 
anisochronous when the analysis is somewhat morc involvcd). 

Of course lo must verify a diophantinc condition: but in view of using thc rcsult to 
show that the tori resisting the pcrturbation, i. e. the ones described by rcmark (7) to 
theorem 1 above, are dense enough to build long chains of tori joincd by hctcroclinic 
trajectories, ^'heteroclinic chains", we must consider tori that verify a very generous 
diophantine property, compared to (2.2). 

Since we shall not discuss the ambitious application attempted in [CG] we just allow 
rotations vectors verifying (2.2) with a fixed, possibly very largo, d. 

Prom lemmata 1,1' in §5 of [CG], all tori with rotation w =^ w {A) (rj" +r]^''A, rf^l'^) 
veriiying: 

\ui-v\> c-^A^lV^"^- (7.3) 

will survive the perturbation if the parameter c in the definition e = [irf of the coupling 
constant is large enough: so that eC{r])~'^ < eo for some eo and some q> O 
The splitting theory is "insensitive" , at given w, to the presence or absence of the 
isochrony brcaking term ^ in (7.1) providod A = ri^~°' ^^q> 0. We discuss this delicate 
point below, for general perturbation f, see (5.9). The homoclinic splitting is given by 
(6.14) with no extra leading terms. The only effect of the anisochrony is to introduce a few 
gaps in the foliations of phase space into stable and unstable manifolds: but it has also 
the advantage that we no longer must be careful about the values of ry. it Anisochrony 
guarantees that the diophantine conditions holds for "many" values of A. 

Turning to the main point of this section (and of the whole paper) we prove that usually 
the first order ("Melnikov's integral") dominates the splitting. Again the technique will 
be based on the general theory of [G3] . 



5//e!)!)roio/2008; 18:17 



20 



In thc anisochronous casc thc grapli labcls havc to be extended, scc (5.7), (5.14) and 
[G3]. On each node v one adds a further node label j„ = 0, 1 (which in the isochronous 
case would be jy = 0) and this has the effect that in the definition of A one replaces: 

p„ (7.4) 

dn "° f S ^ di ... dj f s if l' = va 

where py is the number of nodes Vi,...,Vp^ preceding v. Furthcrmorc thc kcrncls 
u'(t^, r„) become node dependent w„(t^, t„) and equal to w{T'^,Ty) if jy = O and (r^/ — t„) 
if jv = 1- 

The first component Xi{ui_t,t, a) of X ^ will not vanish. Lct us dcfinc a{t) as a + Lut + 
'12^=0 ^''-^1 i ^^'^^ s) where Xi is the first (and only non vanishing) component 2Li- 

The contributions to the splitting Qj due to fruitless trees will be 2 Al^ , with the 
same notations of §6. The full splitting will be: 

Qj=2j2[ < + E / ^i'i>.(Ty)GM(a) + .. (7.5) 

here [r] = O if r = 0, 1 and [r] = 1 if r = 2, 3, and: 

Wo{r) = woo{t), wi(t) = |wo3(t)|, W2{t) = 77, w^ir) = t]\t\ (7.6) 

with G = (G(°\G(i\G(2),C?(3)) representing the fruit values, defined as in §6, for fruits 
which now can carry also a label 2, 3 on the first node: the latter values correspond to 
the fruits carrying label = 1 (2 corrcsponds to a dry fruit and 3 to a ripc fruit): the 
choices of the W2, W3 arise from the form of the operator corresponding to O for the nodes 
with the new labels, called in [G3]. 

In complete analogy with §6 the G verify Dyson's equations. If we set: 



rM(a) = J2 / AtSM Mrs = J2 A^:tl:\<^Ml)r{Tvo){crwUTy) (7.7) 

i' 1?,!/ 

where the matrix a is defined to be ars = O except for the matrix elements aoi = (tiq = 

C23 = 0'32 = 1: 

GM(a) = rM(a) + J2 J (m)r(r„o)AH;W(ai^;),(r,)GW(a) + . . . (7.8) 

where the dots represent contributions from thc graphs with morc than onc fruit, whilc 
the terms explicitly written represent the contributions from the graphs with no fruits 
or with just one fruit. 

At the homoclinic point the derivatives Gj=dj,rj = djT verify exactly: 

G^= T^+aMGj, Gj = {l-aM)-^rj (7.9) 
(compare with (6.6)). 

Thc intcrsection matrix will be, setting as in §6, Tij = 2^^^^ J Al'^ y, i, j = 1,2 and 
G= (c7-c7M(t)-1: 

A,- = r,,- + 2(r ., aGj = r,,- + 2(r ., crj (7.10) 

The convergence of the above series , (7.5)-j-(7.8) and the estimate of their remainders is 
discussed as in §6: see Appendix A2. 
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The above equation is not sufBcicnt this timc: thcrc arc in fact too many variablos. 
There are however several relation between the inatrix elements of M and F , G . In fact 
M is symmetric for the same reasons as the corresponding matrix in §6: i.e. by using the 
symmetry of Oo and of 2^ operators defined in (5.14) (the symmetry of 2^ is proved in 

the same way as that of Oo)- If A'=^ '^^,ijo = r]"' the relations are, up to terms of order 
77°°: 

Mo3 = AMoi - AAo, M23 = AM12 - AAi (^-H) 
M33 = A^Mii, M31 = AMii 

where Ao = {2Lu)-^d2E I Mt,„)\AI and Ai = i2uj)-^d2E I K,\A%, and Z = 
i+AAi • '^^^ relations among the M elements are proved by the same argument discussed 
in §6 for the first of (7.11). One should use the relation Oo{woo\wo3\ simp) = — 2|fi;o3|, 
scc the final rcmark in §6 and note that Oo (sign t _F) (t) = signt Oo{F)(t). The constant 
Z arises solving by iteration (7.9): the structure of the matrix aM and of its powers is, 
given the relations between the Mij in (7.11), such that the first and third components 
of G are proportional via the constant \Z. 

Equation (7.11) allows us to reduce the size of the vectors G, T and of the ma- 
trix M. We shall denote with a tilde the new vectors and matrices. Introduce G = 
(G(0),G«,G(3)), f =(r(0),r(i),r(3)), andM,7Vas: 

/M10 + ZAM12 Mil AMii \ / O 1 A\ 

M = Maa + ZXMo2 Moi AMoi - AAo , ^ = \ ^ ^ ^ (7.12) 
\ M20 + ^AM22 M21 AM21 - AAi y \ ZA O O / 

The equations (7.9), (7.10) become respectively: 

G = (i-M)-ir, A, =r„ + 2(r„7V(i-M)-ir^-) (7.13) 

noting that the matrix G = A''(l — M)~^ is symmetric (because G in (7.10) is symmetric) 
and that it has the second and third row proportional one deduces that: 



det£> = riir22 + 2(r^'V(riiGii + 2(rf ^)2Aoo,ii)+ 

+ 2(rpV(riiG33 + 2(rf ))2Aoo,33) + 4r«rf (riiGi3 + 2(rl°VAoo,i3) 

where Aoo,ii, Aoo,33, Aoo,i3 denote the determinants of the matrices: 

Coo Coi \ f Qoo Co3 \ f Goo Go3 
Cio Cii J ' \ G30 G33 J ' \ Gio Gi3 



(7.14) 



(7.15) 



To compute all the above quantities we note that if we set (see (7.12)) a = Mu, b = 
Moi, c = M12 and x = + ZAM12 = b + ZXc, y = Moo + ZXMo2, z = M20 + ZXM22, 
Ao = -b', Al = -e', Z = {1 + b'){l - Xc')-^: 

(1 — X —a — Aa \ 

-y 1-6 -A&-A6' (7.16) 
—z —c 1 — Ac — Ac' / 

hence det(l - M) = - ((y + XzZ)a - (1 - x)^^ (1 - Ac') =^ A, and (1 - M)-i is: 

. / {1 - b){l - Xc) - Xbc a(l-Ac') Aa(l + 6') \ 

— y(l-Ac) + A^;6 (1 - a;)(l - Ac) - Aa^ {l-x)Xb + Xay (7.17) 
\ yc+{l-b)z {l-x)c + az {1 - x){l - b) - ay J 



5/febbraio/2008; 18:17 



22 



where b = b + b',c = c + e'. Thus the matrix C = N{1 - M)-i is: 

, / y{l - Xc') + \z{l + b') (1 - Ac') A(l - + 6') \ 
C=-\ - Ac') a(l - Ac') Xa{l + b') (7.18) 

A(l-a;)(l + 6') Xa{l+b') X^a{l+b')Z J 

Noting that Aoo.ii = -A'^l - Ac'), Aoo,33 = Z'^X'^Aoo,n, Aoo.is = -^AAoo.ii and 
Cii = (733 = {ZXfCn,Cu = ^ACn with a = Mu (not to be confused with a 

in (7.1)) we get, for a suitable /3 > O, for detD: 

r«r^|) + J",!'-iy (rnMn - 2(r(°))^) + 0(s3,-^e-f''-) 

= rli^fa^ + 0(£3,,-/3e-f ^-^'') = detD 

by thc argumcnt Icading to (6.12), (6.14) and having takcn thc parameter a in (7.1) equal 
to 5 (not to be confused with the a = Mu in (7.18)): this completes the analysis of the 
remarkable cancellations for separatrices spUtting in the anisochronous case. The leading 
order remains exactly the same as in the isochronous case: anisochrony only alters the 
final result by a factor of order (1 + 0(77")), as it should have been expected a priori once 
understood the isochronous case. 

The proof of the domination of the first order now foUows the same path as the cor- 
responding in §8 og [G3]: one uses the above results to treat the first 77"^/^ orders of 
perturbation theory and for the remainder one just use that the series for the sphtting 
is convcrgcnt. 

§8. Heteroclinic chains. 

For completeness we give the argument for the existence of heterochnic chains, following 
[CG], in the easy case of isochronous systems. Below we imagine to have fixed /x and to 

take 77 ^ O (so that e = 0{rj'^)). 

It is worth noting that no gaps {i.e. all actions A are the average position of an invariant 
torus) are present in the isochronous cases (2.1), which is, therefore, very similar to the 
original example proposed by Arnold (also gapless). 

Let Ao = O < ^1 < . . . < Aj^ = Ao and choose correspondingly Bo, . . . , Bj^ so that the 
sequence of action variables {Aj,Bj) describes the time averaged location of invariant 
tori for (2.1) with energy O (say). 

We consider a perturbation like (5.9) for which the splitting is given by Melnikov's 
formula a = 0{rf'e~^'^ ' ) for some b > Q. Since there are no gaps we can choose the 
sequence Ai can be chosen so that Aj+i — Ai < e~^^v ^ ^ for a prefixed ^ > 1 and for 
ah z's. Hence thc number J\f has size OiAge'^^i^ ^^^). 

We want to show that there are heterochnic intersections Hi between the unstable 
manifold of the torus A^ and the stable manifold of ^j+i. Since by construction 
the tori have the same energy this simply means finding a solution for the equations: 
X:^{Tr, a; A^) — X^(7r, a; A^J^^) = O (the energy being equal, this equality then im- 
phes X^{tt, a; A^) — X^{tt, a; A^_^-^) = O, i.e. also the pendulum momenta match). 

The tori equations depend linearly on their average positions, i.e. X_f{TT, a, A)"^-^ A + 
y ^(a) (see theorem 1) where is defined here. We can regard the equation for the 
heteroclinc intersection X^(7r, a: A^) — ^^'(vr, a; A^^i) = O as an implicit function 
problem which for A^_^_i = A^ has a j = O as a solution. 

The linearization of the equation at A^ involves the intersection matrix D a,t A^, a = O 
(which in the isochronous case is A-independent): 
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(8.1) 



showing that the implicit functions problem of dctcrmining thc hctcroclinic point a^ 
can be solved for small enough because detD = 0{r]^e~^^ ) and — = 

0(e-'5f''"'), with^>l. 

It might be surprising, at first, that the equation for a can be solved without an explicit 
estimate of the a -derivatives of the H | ( a ) at points a near O . Such estimates can 
be made directly from the existence theorem: however they give bounds on derivatives 
values that are much larger tha,n a, i. e. they have size 0{ri~^) for some > 0. This may 
seem to undermine the foundations of the implicit functions methods, that rely on the 
solubility of the linear equation. 

However the corrections to (8.1) are bounded by 0{r]~^ a'^): and \ A^j^t^ — < 
g-d-jj Thc solution of the linear cquation (8.1) has size 
Hence near such a the higher order corrections have roughly still size 

0(g-57r,) ): much 

smaller than the linear contribution. This shows that our knowledge of the smoothness 

of X sujfices, togcthcr with thc basic estimate on thc homoclinic angles, to dcducc that 
the linear approximation dominates and to claim that the solutions for the heteroclinic 
point do exist and are very close to those of (8.1). 

Therefore there is a chain of hctcroclinic points Hq, H^. . . . , -Ha/"-! "connccting" a neigh- 
borhood of Aq to one of A_nj-. The "length" of the chain is A/" = 0{e'^^^^ ^^^), i.e. in 
some sense it is the inverse of the splitting. 

We now consider the case in which the system in (7.1) is further perturbed by a 
monochromatic perturbation /?/o(A, <^). The radius of convergence of the whiskers se- 
ries in /?, /i (rccall that e = firj'^) can be shown to have size of order < 0{ri~^), 
< 0(1), see Appendix AlO of [CG]. 

The splitting det D is analytic in /?, /i for /3 <C r]~^/'^, see Appendix AlO in [CG], and it 
is diffcrcnt from O for |/3| <C 0{'rf).fJ, ^ O, providcd thc gcncrically non zero splitting at 
/3 = O is not zero. In the latter case the splitting can only vanish finitely many times, at 
H fixed, in the domain of convergence of the series if ^ 0. 

Hence for all values of /3 closc to 1 (including /3=1 gcncrically in /o), thcrc cxist 
heteroclinic chains as long as we wish. However not having an estimate for the size of 
the splitting we cannot infer from the above argument how long the chains are. 

Thc rcmark is intcrcsting in vicw of thc general fact, [A2], that hctcroclinic chains imply 
diffusion in phase space, i. e. existence of motions starting near the torus at one end of 
the chain and reaching the one at the other hand in due (finite) time, see [G5] for the 
discussion of thc mcthod of proof in [CG] . 

The extension of the above simple analysis is harder in the case of anisochronous systems: 
it was attempted in [CG] but it is invalid because it relies on the error mentioned above. 
For a rcvicw on diffusion scc [Lol]. It is likcly that thc study of this problem will rcquire 
using the techniques of Nekhorossev's theory, see [Lo2] for the best results available, and 
[CG], Appendix AlO. 

§9. Other results. Comments. 

(A) General theory o f splitting. 

We call diophantine with diophantine constants Co,r > O a vector w G such that: 

■ !l\> Co^IeI"", Vi/GZ, !/^0 (9.1) 

compare with (2.2) (here there is no extra parameter r]). We use the notations of §2 for 
the other symbols that are not redefined. 
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Theorem 3: Suppose that w G i? is diophantine and consider the hamiltonian: 

/2 

= ^ + w • A + T-p +5'^o(cos^ - 1) +/i/(a,^) (9.2) 

wiih { A, a'), (/,</?) heing £ pairs of canonically conjugated action-angle variables. Let f 
be a even trigonometric polynomial of degree N and, for simplicity, Jo < J\. Then: 

(1) the separatrix splitting, for the torus with rotation vector u into which the unper- 
turbed torus ^ = O evolves with ji, is analytic in ji near /i = 0. 

(2) the power series expansion of the splitting Q{a) in powers of fi has coefficients with 
0-average; their Fourier components Q^ are bounded, for any S < 1, by: 

,^k,^(JogD5-P{BS-f')>^-'klPe{k,u) , , . 

I ^ - I - \ JogDn-^iBf]-^)''-' - ^ - ^^■'^> 

where D, B are suitable constants and (3, p can be taken /3 = 4(A'' + 1), p = 4r + 4 ifr is 
the diophantine constant of u , and: 



h 

e{k,v)= max ([W\f^ \]e-a-'\!^'--L\(i-^)) (9.4) 

, -Ah j=i 



Remarks: (1) this theorem is the main rcsult of [G3]; note that (9.4) is stronger than the 
form in which it is quoted in eq. (6) of [RW] which refers to the theorem stated in [G3] 
but not to its proof (which gives in fact (9.4)). The method of proof in [G3] could yield 
in fact the resuh for / analytic by using the ideas in [GM] but extra work in necessary: 
see [DGJS],[RW] for alternative proofs; see also [BCG], where the stronger form (with / 
analytic) was derived, in a similar problem. 

(2) Thcre are many instanccs in which the first order expression (called Melnikov's 
integral) of the splitting vector Q^{a), gives in fact the leading behavior (as /i ^ 0) in 
the calculation of the splitting. In the case of fixed co, i. e. for the one time scale problem, 
this foUows from the classical results of Melnikov, [Mc] . 

(3) Another interesting question arises when w = 7c<Jq with lu_q diophantine and 7 a 
parameter that we let to 00: this is a two tim,e scales problem. In the case £ = 2 (hence 

is a constant ujo) and with / a trigonometric polynomial the above theorem proves 
that the splitting is (generically) 0(e~ 2» ^07); f^ct this result was the main purpose 
of the thcory in [G3] (see §8 in [G3] and, in particular, (8.6) and the rclatcd discussion). 
It should be stressed that the latter reference simply provides a new proof of a result 
already obtained, in a slightly different case, by [HMS] or, in the same case, by [Gl], 
[GLT], LST]. The interest of [G3] Hes in the tcchniquc. 

The Melnikov's "approximation" , i. e. the dominance of the first order value of the split- 
ting, is more delicate if > 3. The techniques of [G3] are inadequate to deal with this 
case and they only show that the splitting is smaller than any power of while the 
first order value is 0{e'^'^) for some (computable) c > 0: in the case i? = 3 this has been 
studied in [DGJS], [RW], where the (9.2) is improved by replacing £{k, y_) by the much 
bettcr e~i^ i\^a'iL\, Several cxamples of first order dominance are providcd in the 
latter references. However all examples are constructed with / analytic: it would be nice 
to find a model with a trigonometrical polynomial / for which the first order theory gives 
the asymptotic result. 

(B) Three time scales. Anisochrony strength. Homoclinic scattering. 

The three time scales condition for the first order dominance ("Melnikov's formulae") 
includes the case a = O which is in fact a 2 time scales problem: denoting always 
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thc fast velocity scale, from the analysis of §7 we see that the slow scale could be 77" with 
a > 0. 

This means that the above theory provides a class of models in which the Melnikov's 
formula gives the exact asymptotics as r; ^ O and the perturbation is a trigonometric 
polynomial, of which (5.10) is a concrete example. This does not seem to contradict 
the results of [RW] who show that the Melnikov's formula does not necessarily give the 
leading asymptotics as 77 ^ O in cascs corrcsponding to thcir n = 3, s — 2. 

In the only almost overlapping case a = O, however, the above question is not treated 
in [RW] (they consider the very different case n = 3,s = 2 in which a = — |, i. e. two 
fast rotators and a pendulum) . This illustratcs also that thcrc arc scvcral "2 time scales 
problems" , depending on which pair among the three time scales is identified. 

The value a = O is a case considered with other techniques in the paper [RW] (it 
corresponds to thcir n — 3, s — 1): thcrc tlic attcntion is dcdicatcd to a widcr qucstion 
(namely the leading order of the splitting everywhere on the section ip = n rather than just 
at the homoclinic point). Our asymptotic result is consistent with their theorem 2.1. We 
also gct the completc asymptotics in thc case of trigonometric polynomial pcrturbations: 
but they do not seem interested in this point and deal only with other cases (n = 3, s = 2 
and non trigonometric peturbations); their technigue seems to apply to our (special) case 
a = O as wcll (in fact a simpler case). 

The case a > O is not considered in [RW] except, perhaps, for a remark at the end of 
the abstract and following eq.(15): we do not know whether this case, that is explicitly 
excludcd in thc papcr, can be treated with thcir tcchniqucs. In thc cnd thc main diffcrcnce 
between our work and that of [RW] might just lie in the technique: we have shown that 
the work in [G3] provides all the necessary technical tools for the analysis of the splitting 
and cvcn Icads to an "cxact" cxprcssion for it. It is howcvcr limitcd to thc splitting at thc 
homoclinic point cp = n, a = O (unlike [G3], and [RW] where the splitting is measured 
at a arbitrary, on the section <f = w). 

Thc work [RW] is thc last in a scries of papers (like [DGJS]) which arc incxtricably 
linked with each other (and with [G3], [BCG]). The above comments therefore are easily 
presented in connection with [RW]: but we are fully aware of the role of the other papers 
quotcd in [RW]. 

Fixing a = i the anisochrony coefficient (of A"^) in (7.1) is t]^ with /3 = 2a. The value 
(3 > a is necessary if one wants that the anisochronous and the isochronous splittings 
coincide to leading order as 77 ^ O (at givcn rotators vclocitics): howcvcr thc analysis 
above does not seem yet sharp enough for such an improvement {i. e. taking /3 < 1). 
Finally the physical interpretation of the precession problem {i. e. diffusion in presence 
of a doublc rcsonancc for a a priori stablc systcm) rcquircs /3 = 1, a = ^. 

Extensions of the cancellations theory of §7 to f > 3 seem only a matter of patience. And 
they would be interesting as they can be conceivably used to treat a variety of systems 
and onc should expect that thc results will be quite different whcn thc numbcr of fast 
scales exceeds 1, as shown in the "maximal case" in which it is £ — 1 ([DGJS], [RW]). 
However a general theory of a priori stable systems, with a free hamiltonian without free 
paramctcrs and a perturbation consisting of terms of equal order of magnitude seems to 
reguire suhstantial new ideas. 

In §10 of [CG] there is also a statement about the homoclinic scattering: the techniques 
of this papcr apply to its theory as wcll. Wc havc not workcd out, howcvcr, thc corrc- 
sponding details {the statement was not used anywhere in [CGJ) and at the moment it is 
still an open question for us whether the homoclinic phase shifts are exponentially small 
or not at thc homoclinic point (as claimed in [CG] on the basis of the computational 
error mentioncd above). 

Appendix Al: Computation of the pendulum wronskian. 

The pendulum hamiltonian: H = P /2Jo+gQJo{cos(fi—l) generates a separatrix motion 
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t — > (p^{t) which is exactly computable. One finds, starting at v' = t at t = 0: 

siinp'^{t)/2 =1/ coshgot, sm(p°{t) =2 sinh (70^ (cosh^oO"^ 
cos<^°(t)/2 =tanhgot, cosip^{t) =1 — 2 (coshgio*)"^ 

A further elementary discussion of the pendulum quadratures near E = O, allows us to 
find the E derivatives of the separatrix motion and leads to: 



-2goJo sin ^, 9^7° = Jo(/°)-^ (l + Jo9o{dE^°) sin(^°) 



cosh^oi - . ^ . ^^^^^ 

/ = 4 arctg e~9«*, Sb^jO = (2got + sinh 250*) sin ^ 

cxhibiting the analyticity propcrtics in the complcx t plane that are useful in discussing 
the size of the homochnic angles. The (Al.l) allows us to compute the wronskian niatrix 
of the above separatrices, i e. the solution of the pendulum equation, namely ip = -j^, 
I = Jo5f^sin<^, linearized on the separatices: 

and we get: 

\I'^/C2 dEl"/ciJ' C2 = <^"(0) 

where the E derivative is computed by imagining motions close to the separatrix (which 
has energy E = 0) and with the same initial (p = n. This becomes: 

^ / TE^t 4jf^ \ 2got + smh2got 

The theory of the jacobian elliptic functions shows how to perform a complete calculation 
of the functions Ro,So in (3.2): see [CG], Appendix A9. 

Appendix A2: Convergence of the "form factors" G , F , F and remainders 
bounds. 

Integrals for T,T,M, see (6. 3), (6. 7), (6. 8) and the analogous ones in §7, are precisely 
the object of the analysis of Appendix Al of [G3]. Hence we adhere closely to it. 

Consider any of the form factors, i.e. any of the series in §6 or §7. Following [G3], word 
by word, we obtain a bound on the sum of the contributions of the values of all trees 
'd with m nodes and order h, m < 2h, as L'o-Bo"^^'*"^'^ ™^^o<| i/ |<iv/» | w • with 
Do,Bo constants (see (Al. 13) and (Al. 4) of [G3]: noting that in (Al. 4) m!^ is missing 
due to a typo). This factorial becomes /i!^ in the bound (8.2) of [G3] (and consequently 
on the form factors and det D at order h, which interest us here and which are expressed 
essentially by the same integrals) because m <2h and it is responsible of the first value 
4 in the variable called p, 'm [G3], p = 4 + 4t. 

We know from theorem 1 above, that detD is given by a convergent series but we do 
not know whether the series for F , F and for the matrix M converge. For the "dressed 
form factors" G one could in fact show that the series in s converge for small e by the 
method developed in Appendix Al of [G3] (consisting in reexpressing the G in terms of 
the unsplit operators O): but a lot of work can be saved because as we show now we do 
not need to know such convergence properties and the analyticity of det D sufHces. 
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Onc can find somc iniprovcmcnts ovcr tlic' bounds (8.2) of [G3] by using thc intcrmcdiate 
bound (Al. 13) in [G3] on thc integrals expressing the h-th order coefficient of detD: 
£>iS^~^£:''/i!*niaxo<|j^l<]Vfc I w • u\~^^ with D\,B\ constants. 

One can then bound small divisors by {ri~'^{hNY)'^^ ^ or rj' via (2.2): which 

accounts for part of the constant /3 and for the 4t in the value of the constant p = 4t + 4 
in the second bound in (8.2) of [G3]. It also accounts for Q in the first bound. 

In fact onc has a bcttcr bound as long as Nh < rj^^ bccause it is only for Nh > 1 
that one can find really small divisors: for N h < -yri^^, with any prefixed 7 < 1, we 
can bound | w • i^j below by rj^^"^, if rj is small enough. Hence for such values of h we 

have the bound: D\B^ {er]~^)^ without any factorial, with Di,Bi constants and for 
h < N~^ri"^j. This bound holds for all form factors and also for detD (note that it 
does not imply any convergence). 

On the other hand we know that det D is anal}rtic in e for £ < 77^^ and c large enough. 
Hence, to order h, det D (but a priori not individual form factors) can be bounded by: 

D2B^-^{er]-^f, for all /i > 1 {A2.1) 

for some c > 0. It was conjectured in [G3], see Appendix Al, that this bound could be 

obtaincd dircctly from thc graphical cxpansions. This has bccn provcd in [Gel] (gctting 
c = 4(i if d is the diophantine constant in (2.2)); the same proof shows convergence of 
the series in §6, §7 for the form factors and for M by bounding them, order by order, by 
(A2.1) (with different constants D2, B2): but we are showing that such stronger result is 
not needed here. 

Finally in the case of T^\T^\Ti2, i- e. in the case of the "bare" or "analytic form 
factors" , which arc cxprcsscd as integrals of analytic functions, onc can further improve 
the bound by the usual T„-variables integrations shift to ImT„ = ±i(f — »7^^^), choosing 
the quantity called d in [G3] as r/^^^, a natural but quite arbitrary choice. 

One checks dircctly (as explaincd in [G3], Appcndix Al) that this simply introduccs 
a factor r]~^ wth /3' = 2(A^ + 1) due to closeness of the singularities of the wronskian 
functions or of the /(<^(t^), wr^,) (located at the same places because / is assumed to 
be a trigonometric polynomial); it introduccs also an ciponcntially small factor. s h ~ 
mino<| J/ |<jv/(,o<|i/2| e~^'— so that the bound of the h-th. order, for such form factors 
(and hence for det D by §7) becomes : 

Dzhf Bli-Uer^-^f max e-J'^'^l, ioY &\\ h < MNr])-^ {A2.2) 
o<|i/|<JVM<l''2| i\ u \ J 

with (3 = 2{N + 1) + 2 and B-^^D^ constants. Note that in order to deducc this all 
we really need is that the exact expressions (6.12) and (7.10), regarded as formal power 
series, be true order by order in the expansion. We do not really need the (yet unknown) 
convergence of the series for the F 's and for the matrix M, which we think should be 
nevertheless true. 

Since {er]~'^)^ = {2er]~'^)^ 2~^ and for r} small enough and h > {Nr])~^^ the quantity 
2"'' is < e~^'^ ' the bounds (A2.2) and (A2.3) can be combined into the bound for 
the /i-th order value of det D: 

D4{Bier]-''fe-^i~'^^ for all /i > 1 {A2.3) 

with c — max(2(iV + 1) + 6, c) and B4, D4 suitable constants, immediate consequence of 
(A2.2) (and {u - !y\> r]'^''^ - Nhr]^''^ for V2 ^ 0). 
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